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Abstract. We consider linear statistics of the scaled zeros of Dirichlet L— 
functions, and show that the first few moments converge to the Gaussian mo- 
ments. The number of Gaussian moments depends on the particular statistic 
considered. The same phenomenon is found in Random Matrix Theory, where 
we consider linear statistics of scaled eigenphases for matrices in the unitary 
group. In that case the higher moments are no longer Gaussian. We conjecture 
that this also happens for Dirichlet L-functions. 



1. Introduction 

Let q be an odd prime and \ a Dirichlet character modulo q. For 9te(s) > 1 the 
Dirichlet L-function L(s,x) is defined as 

iCD-f^-nfi-^)" 1 

n— 1 P 

Each function L(s,x) has an infinite set of non-trivial zeros 1/2 + yy x j which 
can be ordered so that 

• ■ • < 5Ke( 7x ,_ 2 ) < 9te(7x,-i) < < Kefo.i) < ^e( 7x , 2 ) < . . . 

Note that we don't assume the Generalised Riemann Hypothesis (GRH) since we 
allow the "f x .j to be complex. 

Denote by N(T,\) number of non-trivial zeros such that < 9te(7 x ,j) < T. 
Then for fixed T > 0, 

1 ^ /m x T , Q T 



S N ( T >X)~ ^og— asg^oo 



q- 2 ^ ' ^' 2ir ° 2ne 

where the sum is taken over all the q — 2 non-trivial characters modulo the prime 
q, see Titchmarsh |p2| , Siegel [jl9|, Selberg [p^[ . We will therefore scale the zeros 
by defining 

._ logg 
x xJ ■— 2 n 

The purpose of this paper is to consider linear statistics of the low-lying x Xt j. 
Let / be a rapidly decaying even test function, and consider the linear statistic 

oo 

Wf( X ):= £ f(x xJ ) 

j=-oo 
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Linear statistics for low-lying zeros of several families of L-functions were inves- 
tigated systematically by Katz and Sarnak fic| and by Iwaniec, Luo and Sarnak 
|9) where they were called "one-level densities" . We prefer to use the terminology 
"linear statistic" which is traditional in random matrix theory. 

Define the x~average of Wf (x) as 

X=FXa 

In order to understand the distribution of Wf (x) we calculate its first few moments 

W7 1 ) . In §0 we prove that if supp/ C [—2,2], then the mean of Wf(x) is 
q ~ 
f( x ) anc ^ m §11 we snow that the variance converges to J_, \u\f(u) 2 du if 

supp/ C [—1, 1]. In §|| we show that if supp/ C (— 2/m, 2/m) (for m > 2) then 
the first m moments of Wf converge to the first m moments of a normal random 
variable with mean and variance as above. 

If all moments of Wf (x) were Gaussian, then we would be able to conclude that 
Wf was normally distributed. Indeed, it follows from the work of Selberg that 
scaling the j x j by anything much less than logg, leads to a Gaussian distribution. 
However, with scaling on the order of logq, this cannot be the case for all / since 
taking / to be an indicator function, the limiting distribution is discrete. We 
therefore say that Wf displays mock- Gaussian behaviour. 

As suggested by Katz and Sarnak jl0| , one may try to model properties of Wf (x) 
by random matrix theory. Let U be an N x N unitary matrix, with eigenvalues 
e l6n . The statistical distribution of ^9 n has been conjectured to converge to the 
empirical distribution of x x j as q and N both tend to infinity 

Therefore, as an aide to understanding Wf(x) — Yl'jL-oo f( x xd)i one m ight wish 
to calculate the moments of Y^,n=i f(w'^n)- However, since the 6 n are angles, it is 
more natural (and indeed more convenient) to consider the 27r-periodic function 

00 

F N (d):= /(^ + 2 ^)) 

j=— 00 

and model Wf(x) by 

JV 

Z f (U) :=Y, F *Vi) 
3=1 

where U is an N x N unitary matrix with eigenangles 0\, . . . , 9n- Note that the 
scaling N/2ir (the mean density) is equivalent to the scaling for the zeros of 
L-f unctions. 

Our results for Zf(U) are given in §§||j7| Writing E to denote the average over 
the unitary group with Haar measure, then without any restrictions on the support 
of the function /, we prove that E{Zf(U)} = f(x) dx, and that the variance 
tends to 

/oo 
min(l,| W |)/(u) 2 du 
-00 



Observe that this is in complete agreement with the mean and variance of Wf (x) 
if / has the same support restrictions. Furthermore, we show in §6.1 that for any 
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integer to > 2, if supp/ C [— 2/m, 2/m], then 

, m , I if to odd 

lim E{ Z, - E{Z f }) m } = m , m 

JV-oo i 2 ^/^( m /2)! fT if m even 



where <r 2 , the variance, is given in (1.1). These are the moments of a normal random 
variable, so again we see mock-Gaussian behaviour, with the same restrictions on 
the support of / as in Wf(x)- 

To understand the mock-Gaussian behaviour, note that if we had defined 

oo 

4 £) (*)= E f(L(0 + 2nj)) 



where L — * oo subject to — * 0, then Soshnikov (see also Q]) has shown that 
the mean of Zf(U) converges to X-^oo ^( a; ) an< ^ ^hat ^ ne cen t ere d random 
variable Z { f L) - e{z} L) } converges in distribution to a normal random variable 

with mean zero and variance f_ f{u) 2 \u\ du. Our scaling is L = which is 
just outside the range of Soshnikov's result. Indeed, note that the variance ( |l.l[ ) is 
different if supp / % [—1,1]. 

In §0 we show that all moments of Zf can be calculated exactly within ran- 
dom matrix theory, without any restrictions on the support of /. They are given 
by a complicated expression, but are certainly not Gaussian moments in general. 
The moments of Zf(U) grow sufficiently slowly that they uniquely determine its 
distribution. 

Finally, in §|| we apply the results of §§|],[| t° show that, under the assumption 
of GRH, for each q, there exist x such that the height of the lowest zero is less than 
1/4 times the expected height. We also obtain a similar result where a positive 
proportion of L-functions have their first zero less than 0.633 times the expected 
height. 

Linear statistics of the high zeros of a fixed L-function also show mock-Gaussian 
behaviour J8j , having the same moments as the linear statistics of low- lying zeros 
considered in this paper. 

Moments of linear statistics in other classical compact groups, like SO(2N), 
SO(2N + 1) and Sp(2N) also show mock-Gaussian behaviour, Q. Other L- 
functions can be modeled by these groups. For example, in the case of quadratic 
L-functions mock-Gaussian behaviour can be deduced from the work of Rubinstein 



17|. Specifically, if supp/ C ( — — , — ) then the first to moments are Gaussian with 

mean /(0) — f(u) du and variance 4 J^ 2 uf(u) 2 du, exactly as in the group 
Sp(2N). We remark that this is half the unitary range. Assuming GRH, the re- 
sults of Ozliick and Snyder |l5) show that the mean is indeed /(0) — J Q f(u) du so 
long as supp / C (—2, 2), in the sense that 



7>£' 



7logL\ r ' 



^ „ £ fi l^ ) = f{0) _ / f{u) du + o(1) 

d 7 

We note that the arguments given here to study moments of the linear statistic 
show that the "n-level densities" |k], |l7j of this family of L-functions coincide with 
those of the unitary group, in a suitable range, by purely combinatorial arguments. 
Since that is not our purpose here, we leave it for the reader. 
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Throughout all this paper, the Fourier transform is f(u) = f(x)e 2mxu dx, 
and thus the inverse transform is f(x) = J_ f(u)e 2mxu du. 

2. The scaled level density Wf 

2.1. Zeros and the Explicit Formula. A Dirichlet character \ : N — ► C is a 
function such that x( n + 0) = x( n ) f° r all n 'i x( n ) = if n and q have a common 
divisor; and x( mn ) = x( TO )x( n ) f° r all TO ? n - We say \ is the trivial character 
modulo q (denoted xo) if x( n ) = 1 f° r all n coprime to q. We say that x is even if 
X(-1) = 1, and odd if x(-l) = -1. 

The Explicit Formula is the following relation between a sum over zeros of L(s, x) 
and a sum over prime powers. To describe it, let 



a(x) = 



X even 
X odd 



and let h(r) be any even analytic function in the strip — c < 3m(r) < 1 + c (for 
c > 0) such that \h(r)\ < A(l + \r\)-( 1+5 1 (for r e R, A > 0, S > 0). Set 
= 27 I-oo h(r)e~™ dr, so that h(r) = g(u)t 

(2.1) J2 h ^,x) = ^J h(r)(logq + G x (r)) dr 

V-A(n) 



g(logn) (x(n) + x(rc)) 



where 



G x (r) = ^-(i + o(x) + ir) + ^ + a(x) - ir) - ^ log^ 



and the von Mangoldt function A(n) is defined as logp if n = p k is a prime power, 
and zero otherwise. 

2.2. A decomposition of Wf. For test functions / define the scaled level density 
Wf(x) as 

3 

the sum over all nontrivial zeros of L(s, x)- 

Definition 2.1. f(x) is an admissible test functions for Wf(x) if it is a real, 
even function, whose Fourier transform f(u) is compactly supported, and such that 
/(r)«(l + |r|)- 1 - 5 . 

We will transform Wf into a sum over prime powers by using the explicit formula 
for L(s, X ). In Q, take h(r) = f(^r), so that g(u) = ^-J(^- q ), and note 

that the conditions on / easily imply the analyticity and decay condition on h(r) 
in the explicit formula. We then get a decomposition of Wf(x) as 

(2.2) W f ( X )=W f -(x) + W? sc (x) 

where 

Wx) ■= ^ r /(^)(log? + G x (r)) dr 
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and an oscillatory term 

ff r(rf— i^E^/'O <*»)+*»»■ 

The first term Wf(x) gives 

W>&)= f° /(*) d^ + 0(r^-) 
which is asymptotically independent of \- 

3. The expectation of Wj 

The "expectation" of Wf is defined as the average over all q — 2 nontrivial 
characters modulo prime q 



TO, ; = 7^2 E W fW 

Theorem 3.1. Let / be an admissible function, and assume supp(/) C [—2,2]. 
Then as q — > oo, 

w) .=/_> )di+o G4)' 

Proof. We will use the decomposition (2.2) and average over \'- 

(Wf) q = {W f ) q + (Wf sc ) q . 

Since Wf is asymptotically constant for any /, we have 

r°° i 

and thus it will suffice to show that 

(w? sc ) =©(-!-). 

x J 7 1 log 9 

We will show this under the assumption supp(/) C [—2, 2]. 
We have by that 

(Wr% = -r 1 " E ^/(^) (&(»)>, + (X(n)) q 
x J "? log g ' v n q ^ q 

The mean value of x is 

(1, n = 1 mod g 

0, g| n 



— ' ^ ^ 0, 1 mod 



'7 



Thus we find 



/ w osc\ = >p A(n) c-.logn. 2 1_ A(n) c Uogn , 

\ / ; « logq ^ V7 Jl logg j loggg-2 0> ^g?' 

n— 1 mod q n^l.O mod g 
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Assume that supp(/) C [—a, a] for a > 0. Then the sum is over n < q a . Since / 
is bounded, we may replace it by 1 over that range, and we therefore have 

(3-i) {WD « r J - E ^ + r^^E^ 

X 1 /q logg _f-^ , V™ logqg-2 ^ o V" 



n=l mod q ' ' 7i<q a 

n<g Q 



To deal with the first sum in fl3.1| ) one could replace primes by integers by noting 
that A(n) <C logn, obtaining 

1 A(n) 1 logn 

logo log Q \A& 

n=l mod q n=l mod g 

n<q a n <Q a 

1 \ ^ log(mg + 1) 
logq ^ V m 9 + 1 

m<q a 1 

which vanishes for a < 2. However, one gets a slightly stronger result by the Brun- 
Titchmarsh Theorem, fl3|| , which says that if ir(x;q, a) is the number of primes 
p < x, p = a mod (j, where q and a are coprime, then for x > 2q, 

2x 

tp(q) \og{x/q) 

Therefore, for a > 1, 



1 A(n) _L_ logp 

, fin " lnco 



log" , Vn logo _i , VP 

n=l mod q P=i mod q v 

1 logxl da; 



logg7 29 glog(a;/g) 



1 -l+a/2 



< ; — g 

logo 

which vanishes for a < 2. 

To deal with the second sum in (3.1), one could similarly replace primes by 
integers, and note that 

1 1 ^ logn _ __ 1+ct/2 



7aT <<,? 



logqg-2 ( 

which vanishes if a < 2. Again this result can be strengthened by using the Prime 
Number Theorem, since 

11 A(n) 11^ l°gP 



. /t7 lncr n n ^-^ 



loggg-2 0» \ogqq^ a Jp 

n^0,l mod g 

ll/" 9 logx dx 



log q q J 2 V% l°g x 
1 

\ogq 



« -if 11 "' 2 



which vanishes for a < 2. 
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Thus (Wf sc ^ < q \ogq 2 and so if a < 2 we find (w° f sc ^ -► as required. □ 
Remark. Set 

which is like Wf(x) but with the zeros shifted by height t. Averaging over all char- 
acters modulo q, and doing an extra smooth average over t, the expected value of 
W^p converges to J^° 00 f{x) dx without any restriction on the support of f . 

4. The variance of Wf 

Theorem 4.1. Let f be an admissible function and assume supp/ C [—1,1], then 
the variance of Wf tends to 

(4.1) a(ff = [ \u\f(uf du 



Proof. The variance of Wf is, by Theorem |3.l| 

((^-TO 9 ) 2 ) 9 H(^rn+^(i4) 

and by ( fO| ) 

/ (w o SC) 2\ 1 A ("i) A h) -/ lognA ^/ logn 2 \ 

x ((x(«i)x(n 2 )) 9 + (x(ni)x(n2)) q + (x(ni)x(n 2 )) q + (x(«i)x(^2)) g ) 



Now, 



and 



1, niri2 = 1 mod q 
{x{ni)x{n 2 )) q = {x{n 1 n 2 )) q = I 0, n x or n 2 = modg 



— , otherwise 



1, rii = n 2 ^ modg 
(x(«i)x(«2)) 9 = ^ 0, Hi or fi 2 e modq 
4- , otherwise 

q z 

Since we assume supp / C [—1,1], we need only consider rti,?i2 < <?■ Therefore, 
writing ni for the inverse of n\ modulo q, 



2 y4 A(m) A(ni) j/ logni 

n7^2 V^l V^l V l0 S9 
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By the Prime Number Theorem, 



log g *-* ^ // 

n<q 



A(n) V? 



logQ 



and so the term in (4.2) is bounded by fl g g \a ■ 



Now, 



(log?) 2 



9-1 

E 

ni=2 



A(ni) A(ni) j/ logni 
V l°g<7 



log»i 
logg 



V- 1 1 

p,p prime 

= y 1 

p.p prime 

where fc p is defined so that pp — 1 + fc p g. By unique factorization, there are exactly 
two primes less than q which produce a given k p , namely p and p. Therefore there 
can be at most \it(q) different k p , all lying between 1 and q — 1, and so 



(log 9) 



I A(»i) A(wi) ^ 



f 



Tlx V n l 



fc=l 
1 



Vkq+l 



(1 



V<7 V logg 



Inserting this into (4.2) we have 



((wrf) = 



9-1 

E 



A(n) 2 j-.logn 



(logg) 2 ^ in 



\ogq 



f + o 



( 1 



J2 ( l °SP) 2 /( lQ g P )2 i ,o 



\Vl°g9 
1 



(log 9)'^ P 



log?' 



(log?) 2 



q (log a;) 2 c- log a; 2 da; 



log q log x 



%/log<7 



\/logg 



m/(u) 2 du + e> 

'log 2/ log q 

and so we see that, since f(u) is an even function, 



1 



\/log<? 



hm ({W^f) = I HfXufdu 

q — >oo J H I 



as required. 



□ 



Remark. Assuming GRH, Ozliick |l4j shows that the variance of linear statistics 
of the scaled zeros shifted by t and weighted by a smooth function K , converge to 
the weighted form of (4.1) so long as supp/ C (—2,2), when averaged over t and 
over all c haracters of modulus less than q. In fact, random matrix the ory s uggests 
that (4.1) is the correct variance for all admissible functions (Theorem 6.i ). 
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5. The moments of Wf 

We now attempt to understand the distribution of the scaled level density Wf 
around its expected value. We will find that the first few moments of Wf converge 
to those of a Gaussian random variable with mean limW/ = f(x) dx and 
variance 

/>oo 

min(l, \u\) f(u) 2 du . 

Theorem 5.1. Let f be an admissible function, and assume that 
(5.1) supp/ C [—a, a], a > . 

If m < 2/a, then the m-th moment of Wj sc is 

inadwrr) =/^fW (j ^ m ' meven 

<?^°° 9 I 0, m odd 



where o~(f) 2 , the variance, is given in (4.1). 
Proof. By ( |2.3| ), we have 

wnx) = - e ^/(^) w») + «»)) 

This gives 

WW " ( -^'" ft £ O + *■*» 

= £ J(S)(x) 

Sc{l,...,m} 

where for any subset of indices SC. {1, . . . , m}, the summand J(S)(x) corresponds 
to the different ways of picking x and x : 

j(s)(x)-.=(-^-r e n^/(^)x(n^)x(n^). 

logo ' J -- L ^/n7 1°S<? 7. r" 

Now average over the nontrivial characters Xj using 

/ \ f !. Flies % = Il^s n i ^ mod 9 

(x(IJraj)x(n n< )/ = \ °> any one of = mod g 

\ ^-s / 9 l-l/(g-2), otherwise 



This gives 



with 



<(^r(x)) m > 9 = e 

SC{l,...,m} 



(5.2) { j(5)>, = (-J-r e n^/(^) 

log (7 J - J - ,/n? logo 



(if JL Y A ^f( logn )\ 
\<? \ lo g° „ y/n !ogo y 
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To bound the remainder term, use supp/ C [—a, a] with a < 2/m to estimate 
the sum 

EA(n)-logn A(n) a/2 



Thus the O-term in ( p.2| ) is bounded by 

1 



<?(logg) r 

Thus we find 



ma/2 ^ -(l-roa/2) 



logo ' J - J - ,/n7 log q 

We split the sum for (J{S)) into terms J eq {S) where we have equality Iljes n i = 
Ili^s Ui ra ther then mere congruence modulo q, and the remaining terms: 

(J(S)) q = J eq {S) + Jcon g (S) + 0{q-^- ma '^) 

with 

(5.3) J eq (S) := (- J-)- E ft^/^) 

ws)H-r-r E n^/(^) 
lo8<? :i 9 « ;t .i, ' x~ lo s« 

5.1. Eliminating congruential terms. We will show that the terms J CO ng(S) are 
negligible: 

J co „ 9 (S) « q-d-'WsH, 

for all e > 0. 

Lemma 5.2. Assume XY = o(q 2 ). Then 

v ^ 1 v'AT 

M=£N 
M=N modg 

and moreover if max(A", Y) = o(q) then the sum is empty, hence equals zero. 

Proof. We may assume X < Y and so certainly X = o(q). Then in the sum we must 
have M < N since otherwise, M = kq + N with k > 1 and so q < M < X = o(q) 
which gives a contradiction. If Y = o(q) then likewise the sum is empty. Thus we 
now assume that Y ^> q. In that case we write N = kq + M, 1 < k < Y/q. Then 
our sum is 

^ S /q ^™ <K ^ ]k, q ^ 



i Vy Vxy 

« E « 



M<X J M * 

as required. □ 
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Lemma 5.3. Assume that r + s = m, and that a < 2/to. Then 

1 \- TT A ("») TT A ( W j) ^ n m<x/2-l+e 



(logg)' 



ni,...,n r .mi,...m a <fj 
FJ ni=rj mj mod g 

n #n ™j 



Proof. We replace the sum over prime powers by the sum over all integers and since 
all variables are bounded by q a , we replace A(n) by logg. Thus our sum is <C than 



X] IT ./tttIT ^ 



ni,...,n r ,mi,...m s <g 
FJ rii=rj mj mod q 

Now set N = Y[ n ii M = Yi m j an d sum separately over those tuples n\, . . . ,n r 
with TV, M fixed. The number of such tuples is <C q e for all e > 0. Thus our sum is 
<C than 

£ -A- 

M<q sa ,N<q ra V 
M=N mod q 



which by Lemma 5.2 is <C q e + ma / 2 1 for all e > 0, since g™ ■ g SQ = o(q 2 ) if 



ma = (r + s)a < 2. □ 

Thus we find that 

(5.4) ((Wf )™) = Je q (S)+0{q- {1 ~ ma/2)+e ) 

SC{l,..,m) 

for all e > 0. 

5.2. Re duc tion to diagonal terms. Fix a subset S C {1, . . . , to}. The sum in 
J eq {S) ( |5.3| ) is over tuples (m, . . . , n m ) which satisfy IljeS n J = l~L^s n »- We say 
that there is a perfect matching of terms if there is a bijection cr of S onto its 
complement S' c in {1, . . . , to} so that rij — fi a (j\, for all j € 5. This can happen 
only if to = 2fc is even and ^S 1 = = k. 
Decompose 

(5.5) J eq (S) — Jdiag(S) + Jnon(S) 

where Jdiag (S) is the sum of matching terms - the diagonal part of the sum (nonex- 
istent for most S), and J n on(S) is the sum over the remaining, nonmatching, terms. 



5.3. Diagonal terms. Assume that to = 2k is even. The diagonal terms are the 
sum over all ( 2 fe fc ) subsets S C {l,...2fc} of cardinality k = to/2 and for each 
such subset S, Jdiag(S) is the sum over all kl bijections a : S — > S c of S onto its 
complement, of terms 

( 1 v A(n)2 fr logw i a ^ k 

\{\ogqY^ n J{ \ogq > J 
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We evaluate each factor by using the Prime Number Theorem: 

1 v A(n) 2 frlogn a 1 f 00 \ogt ? \ogt 2 



(log*?) 2 „ n log<? (logg) 2 J 2 i logg 

(5.6) 

uf(u) 2 du 

Since our function is even and supported inside (— 2/m, 2/m) C (—1,1) (since 
to > 2), we can rewrite this as 



mm(l, \u\)f(u) 2 du =: a(f) 2 /2 



This shows that for to = 2k even we have as q — > oo that 

(2fc)! 
2 fe fc! 

SC{l,...,m) 

Below we will show that the nondiagonal terms J n o n(S ) are negligible, and hence 
by (5.4) and (5.5) we will have thus proved Theorem 5.1. □ 

5.4. Bounding the off-diagonal terms J non {S). We will show that 
Lemma 5.4. 

Jnon(S) <C 



log (J 

Proof. Since 

1 y^ logff 1 y^ logP 1 



log Q p ^ P fc/2 log <? ^ P 3/2 log q 



the contribution of cubes and higher prime powers to (5.3) is negligible, and we 
may assume in J n on(S) that the m are either prime or squares of primes (upto a 
remainder of 0(1/ log q)). By the Fundamental Theorem of Arithmetic, an equality 
Hies n i — YljeS c n i f° rces some of the terms to match, and unless there is a perfect 
matching of all terms, the remaining integers satisfy equalities of the form n\ni = 
with ni = n,2 = p prime and 713 = p 2 a square of that prime. Thus upto a remainder 
of 0(1/ log q), J n0 n{S) is a sum of terms of the form 



J \^ (logp) 2 f A°SP k )2 I 1 \p (logp) 3 y.logP^^ logP 2 ^ 

(log 5 )2 pk J ^i ogq ) 'Uiog^sZ. p 2 n logq ) a logg ; 

fe=l,2 ' 



with 2u + 3u = to, and u > 1 . 

We showed (|5.6f ) that the matching terms have an asymptotic value, hence are 
bounded. We bound the second type of term by 

1 \p (logp) 3 ^logPj2j( logP% <c 1 \p (logp) 3 < 1 
(logg) 3 ^ P 2 logq logq (logg) 3 ^ p 2 (logg) 3 

Thus as long as v > 1 (that is if there is no perfect matching of all term s), we get 



that the contribution of J n0 n(S) is 0(1/ log q). This proves Lemma 5.4 . □ 



LINEAR STATISTICS OF LOW-LYING ZEROS OF L-FUNCTIONS 



13 



6. The random matrix model 

Let f{x) be an even real function subject to the decay condition that there exists 
a fixed e > and A > such that 

(6.1) f(x) < A(l + |x|)-( 1+e ) for all x G R 

Define 

oo 

F N (0):= E f(K 9 + 2 ^)) 

3=— oo 

so that Fn(6) is 27r-periodic. Define 

N 

Z f (U) := E Ftf(Qj) 

where U is an N xN unitary matrix with eigenangles #i , . . . , 6m ■ This is the random 
matrix equivalent of Wt (x) ■ 

The Fourier coefficients of Fn(9) are 

an,N = [ F N (6)e- ine d9 



— 7T 



2tt 

- / f(x)e~ 2 ™ x / N dx 



N 



OO 



N \NJ 



and so, if the matrix U has eigenangles 6\, . . . , 6^, 

N 



Z f {U) :=Y,F N (9j) 



3=1 
oo 1 

(6-2) = E ^) T ^ 

71= — OO 



E {Tr U n } 



N, n = 
0, otherwise 



Since 

we have thus proven 
Theorem 6.1. 

E{Z/} = /(0) 

The definition of Fourier transform we use is such that /(0) = f(x) dx, so 



this Theorem is in perfect agreement with Theorem 3.1 

'OO 

(6.3) (j^ = / mm(\u\,l)f(u) 2 du 



Theorem 6.2. XTie variance of Zt tends to a 2 as N — > oo, where 
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Proof. Since § | |§ 

E{TrZ7"TrZ7 m } = 



iV 2 


if n = to = 







|n| 


if n = — m 


and 


|n| < iV 


N 


if n = —rn 


and 


\n\ > N 





otherwise 







we have 



IE {(*,- 7(0)) }= E ^/ 



n— — oo 
n/0 



min(|u|, l)/(u) 2 dit 

the last line following from the definition of a Riemann integral, and from the fact 
that f(x) is even. □ 



Note that this is the same as the variance of Wf(x) (Theorem 4.1) when / is 
restricted to have support contained in [—1,1]. 



6.1. Mock-Gaussian behaviour. From (6.2) and Theorem 6.1, the mth centered 
moment is 

E{(Z f -E{Z f }) m } 

OO oo - 1 

= E •• E ^)-4^) E ^ ni -^" m > 

n\ — — oo n m =— CO 
m#0 n m #0 

The following two Lemmas will enable us to calculate these moments, under certain 
restrictions on the support of /(«)■ 

Lemma 6.3. IfY^j=i n j ^ then 

Y[TiU r 

3 = 1 

Proof. By rotation invariance of Haar measure, the left hand-side is left unchanged 
by multiplication by Ie 10 (where 9 is an arbitrary angle, and / is the identity 
matrix). Since Tr ((UIe w ) n ) = e in9 TrU n , this means 

!m j / m \ I m 

Y[ TrU n i I = exp i6»E»i E S II TrC/ ™ 3 

which is true only if either both sides are zero, or if Y^JLi n j = 0- ^ 
Lemma 6.4. (Diaconis, Shahshahani || [|]). For a,j,bj S {0, 1,2,...}, if 

(k k N 

E'"< ' E - /7; ' 
3=1 3=1 
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then 



J] (TrU-^f = ,V.,,1J,- „,! 

J = l J j=l 

where 8 a fi = 1 if a>j = bj for j = 1, . . . , k, and 5 a ^ — otherwise. 
Theorem 6.5. For any integer to > 2, z/supp/(u) C [— 2/m, 2/m], then 

if m odd 

if m = 2k, k > 1 an integer 



lim E 



{(^-/(o)) m } = {° 

{ 2 k k\ C 



where a , the variance, is given by ( p.3[ ). 

Proof. The restriction on the support of f(u) gives 



(6.4) E{(^-/(0)) m } 



N> 



2N/m 

• E 

n 1= -2N/r 
ni#0 



2N/m 



E /(^).../(^)E{Tr {/-... TrL/-} 



-2N/r. 



Lemma 6.3 means that to have a non-zero contribution, rij = 0, and so 



\nj\<2N/m 



^\j,1 { „ j>0} 
j'=i 



< 



m 27V 
2 m 



the maximum is obtained by all the positive terms equal to 2N/m, and all the 
negative terms to —2N/m. (This maximum is obtainable only if m is even). Thus 
we see that the support restriction means all the nonzero terms in ( |6.4| ) can be 
calculated using Lemma 6.4. 



To obtain anything nonzero using Lemma 3.4, there must be a bijection a map- 
ping {1, . . . , m} into itself so that rij = — TW j) for all j. Note that no nj can equal 
zero, since this is expressly forbidden in (6.4). 

For odd to, it is impossible to pair off the rij without having at least one rij = 0. 
Therefore ( 3^4 ) is zero for to odd. 

For even to = 2k, assume the nj are such that they can be paired off, and relabel 
so that r\ — nj t where ji is the smallest number such that rij 1 > 0, r2 = n,j 2 where 
j2 is the second smallest number such that n j2 > etc. There are ( 2 fc fe ) ways of 
arranging the positive nj > to give the same . The number of ways of ordering 
the negative rij such that each positive term has a negative partner equals ' ' 
where bi = #{j 



N/k 



N/k 

E-z. 

ri=l J'i- = 1 



n j = — i}. Therefore, after reordering, ( |6.4[ ) equals 

k 



k\ 



h\ b 2 \. 



lUJrU^...\TrU r ^}f[^f(^)f(- 



{I' 



bi! b 2 \. 
-) 



(2k)\ 



k\ 

m 

k\2 k 



(N/k 



16 



CP. HUGHES AND Z. RUDNICK 



since 



1 

61! b 2 \. 



{ 



E<MTrtT 



\TrU' 



} II 



by Lemma OA, and since 

N/k 

E 



N J \N 



l/k 



1/k 



f(u) 



i=i 



du 



2 



when supp/ C [1/fc, 1/fc], where cr 2 is given by (3.3). 



□ 



Remark. One can also prove Theorem (k5 by a completely different method, using 
techniques found in pi]] ; we need to take this route when dealing with the other 
classical groups in M . 



7. Unrestricted moments of Zf(U) 

In this section we will calculate the (uncentered) mth moment of Zf(U) without 
restriction on the support. This allows us to conjecture an extension to Theorems 
3.1 and 5.1. In particular, it appears that the mth centered moment of Wf(x) is 
not Gaussian outside of the range given in Theorem 5.1, which would imply that 
Wf (x) does not converge to a normal distribution. 

We wish to calculate the (uncentered) mth moment of Zf(U). 



M„ 



:= lim 



:{(^/) m } 



A' 



N 



(7.1) 



lim E ■ 

N^oo 



.ii=l 



N \ 



To evaluate M m , we use the r-point correlation function of Dyson: 

Lemma 7.1. (Dyson). For an arbitrary function g of r variables which is 2tt- 
periodic in all its variables, 



E 



N 

E 

ii ,...,i r =l 
ij all distinct 



whe 



with 



(2<T 



9 r ) = det{5 A r(6» i 
sin(A^x/2 



e r )R ( r N \e ll ...,e r ) &e l ...&e r 

~ ®i)}l<i,j<r 



S N (x) 



sin(a:/2) 

Note that the sums in (^J) range unrestrictedly over all variables (they include 
both diagonal and off-diagonal terms), whereas Lemma 7.1 requires the sums to be 
over distinct variables (off-diagonals only). We overcome this problem by summing 
over the diagonals separately. 
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Definition 7.2. a is said to be a set partition of m elements into r non-empty 
blocks if 

a : {l,...,m} — ► {l,...,r} 

satisfying 

(1) For every q 6 {1, . . . , r} there exists at least one j such that o~(j) — q (this 
is the non-emptiness of the blocks). 

(2) For all j , either o~(j) = 1 or there exists a k < j such that cr(j) = o{k) + 1. 
(Roughly speaking, if we think of {1, ... ,r} as denoting ordered pigeonholes, 
then o~(j) either goes into a non-empty pigeonhole, or into the next empty 
hole). 

The collection of all set partitions of m elements into r blocks is denoted P(m, r). 

Remark. The number of a 6 P(m,r) is equal to S(m,r), a Stirling number of 
the second kind. The number of set partitions of m elements into any number of 
non-empty blocks is Y] . S(m,r) = B m , a Bell number. 

Lemma 7.3. For any function g of m variables, 

m 

g(xh,---,x jm ) =^2 E E sK^-^vw) 

Jli — tim r=l a£P(m,r) h,---,ir 

ij all distinct 

Proof. Each term on the LHS appears once and only once on the RHS, so they are 
equal. □ 

Theorem 7.4. 

m . „oo r 

M m = J2 Rr(xi,...,X r ) II^^)^ 

r=lC_ v ^/~°° a£P(m,r) 9=1 

r 

where X q = #{j : o~(J) — q}, and where 

Rr( Xl ,...,x r )=det\ y 3 ^ \ 

I < x i - x i) J l<i,j<r 



Proof. Recall ([7JJ), that 

r N N 

M m = lim E i ^ • • F N(0n). ■ ■ F N (0 im ) 



1 i m —l 



Lemma 7.3 gives 
(7.2) E 




E E E i E F N {6^ 1) )...F N {9 la{m) ) 



r=la£P(m,r) I ii,...,i r =l 
all distinct 



N 



E E E E F N(&n)---F 



N ^lr/ 



r=l irEP(m,r) j ii,...,i r =l 
ij all distinct 
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where X q = #{j : a(j) = q}. Lemma now applies, and gives 

N 



E < 



i\ ,. . ., i r = 1 
ij all distinct 

l 



(27T) 



(9,)... Fft r (OJRW (0i, . . . , 6 r ) d0i . . . d0 r 



1 

iV 7 ' 



AT/2 



7V/2 



27nri 27rx r 



A/ 



iV 



IK* 

9=1 



27rx<j 



dx 



upon change variables to x n — j-9 n . Now, 



j = -oo 



/(X) + O 



/ 1 



uniformly for all x € [—N/2, N/2], due to the decay condition on /, (|3.1|). 
Since 

km _ . L = R Xl . . . x ) 

sui(ir(xj — Xi)) 



where 



R r (xi, . . . , x r ) = dot 



we have 



N 



ii ,...,i r =l 
ij all distinct 



(7.3) limE J2 F^(9 n )...F^(0 lr ) 

•••/ R r (x 1 ,...,x r )Y[f x "(x q ) dx q 

J — OO , 1 



9=1 



Hence, combining (7.1), (7.2) and ( 



(7.4) 



as required. 



{N N \ 

V •■■ V F N (6 il )...F N (9 im ) \ 
— — 
»1=1 im = l J 

£ X! /■■7 ^^i,---,^)ll/ A9 (^) d ^ 



r—1 aeP(m.r) < 



□ 



Remark. One can show that the moments M m of Zf uniquely determine the dis- 
tribution Zf weakly converges to as N — ► oo. 
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8. Application: small first zeros of L(s,x) 

In this section we will apply the results of §§|| and [| to show that, under the 
assumption of GRH there exist Dirichlct ^-functions whose first zero is lower than 
the expected height. Small gaps between high zeros of the Riemann zeta function 
(which also obey unitary statistics) have been much studied. Montgomery 
showed that an infinite number of zeros are less than 0.68 times their average 
spacing. This was reduced to 0.5179 by Montgomery and Odlyzko p2[ ; to 0.5171 by 
Conrey, Ghosh and Gonek ffl; and to 0.5169 by Conrey and Iwaniec, as announced 
in ||. Conrey, Ghosh, Goldston, Gonek and Heath-Brown 0j showed that a positive 
proportion of zeros are less than 0.77 times the average spacing, a result improved to 
0.6878 by Soundararajan (^0|. We should perhaps point out that the main difference 
between gaps between the zeta zeros, and the height of the lowest Dirichlet zero is 
that the point 1/2 in not expected to "repel" low- lying zeros. 



8.1. Infinitely many small first zeros. Using Theorem 3.1 we are able to obtain 



some partial results for extreme low- lying zeros of Dirichlct L- functions. 
Theorem 8.1. Assume GRH. If 

(8.1) lim (W f ) = / f(x) dx 



for all admissible functions f with supp/ C [— 2R, 2R], then 

lim inf min x Y \ < 

where the minimum of the first zero of L{s, \) * s taken over all non-trivial characters 
modulo q. 

Proof. Let g(it) be an even, continuous function, with suppg C [—R,R], and such 
that g(u) is differentiable in \-R,R] and g(x) < \x\~ 3 / 2 ~ s , S > 0. 
Let 



B := 



f °° x 2 g 2 (x) dx _ j^r r o °° g'(u) 2 du 



Jo 9 2 (x) dx y /„ g(u) 2 du 
so, by assumptions on g(u) and its derivative, we see that [3 is a strictly positive 
finite real number. 
Define, for (3 > B, 

f(x) = (x 2 -f3 2 )g 2 (x) 

so that / has the properties 

(8.2) / f(x)dx = -(/3 2 -B 2 ) g 2 (x)dx<0 

Jo Jo 

and 

f{x) | for \x\ | P 

Note that the conditions on g(u) mean that / is an admissible function. 
Observe that 

-1 d 2 

= j£(g'*g')(u)-f3 2 (g*g)(u) 



2(1 
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where (g*g)(u) is the convolution of g with itself. Since differentiation and multipli- 
cation by a constant does not increase the support of a function, we may conclude 
that supp/ C [2R, 2R] since, by assumption, suppg C [-R, R]. 
Therefore, by assumption flS.l| ) and by (8.2), 

1 f°° 

By the assumption of GRH all the x x .j are real, and so we may conclude that there 
exists a go such that for all q > go, 

^E E «<iEEw< Q 

X#Xo j>l X#Xoj>l 
x x j<f3 

and so, for all q > Qq at least one x, a non-trivial character modulo g, exists with 
x x,i — P- (Note that this method produces a non-vacuous result only if j3 < 1, 
since by definition, — * 1.) The Theorem will follow if we can construct a 
g(x) satisfying all the conditions such that B — 1/4R, by letting (3 —> B. 
Taking g(u) — cos (fjf) l{| u |<i?,}, so that 

-4i?cos(27ra;i?) 
9{X > ~ ir(16x 2 R 2 - 1) ' 

we see that 



B 



2 _ dx _ 1 



J °°g 2 {x) dx 16R 2 



Jo 

This concludes the proof of Theorem |3.f| . □ 

Remark. Our choice ofg(u) — cos (f^f) 1{|u|<r} was not an arbitrary one, as this 
is the optimizing function for this method. 

Corollary 8.2. If the Generalised Riemann Hypothesis holds, then 

liminf min x x i < — 

where the minimum of the first scaled zero of L(s,x) is taken over all non-trivial 
characters modulo q. 



Proof. By Theorem B.l we may take R = 1 in Theorem 8.1. □ 
Remark. Random matrix theory suggests that 

lim inf min x x i = 



8.2. Positive proportion of small first zeros. Theorem 8.1 combined with The- 
orem 5.1 allows us to deduce a statement about a positive proportion (rather than 



just infinitely many) of the x have smaller than expected first zeros. 
Theorem 8.3. Assume GRH. For (3 > 0.633, 

liminf — {x ^ Xo : x x ,i < 0} 

q— >oo q — 2 

IItt 2 - 3 - 72(3 2 - 88ir 2 [3 2 - 48/3 4 + 176tt 2 /? 4 



> 



12tt 2 (4/3 2 - l) 2 
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Remark. Random matrix theory suggests that a positive proportion of the \ have 
x Xt \ < f3 for any (3 > 0. 



Proof of Theorem Take 

fp{x) = (x 2 - (3 2 )g 2 {x) 

where 

g(u) = COs(7T1t)l { |„|<i/ 2 } 

(so f(u) has support in [1, 1], and f(x) < for |x| < j3, and f(x) > otherwise). 
As in the proof of Theorem [O] we have 

/oo 
f(x) dx 
-oo 

< for (3 > 1/2. 



By Theorem 5.1 



Bm((w>-<W>>, 



/(«) 



Chebyshev's inequality gives 



limsup— !— #\x^Xo ■ W f -(W f ) >e\ 



q — >oo Q 

and so, using the fact that / is even, 



< 



du 



liminf — !— # <( x 7^ Xo : 

g— >oo g — 2 



where ei = e/2. 

If /3 > 1/2, putting ei = |/ °° f(x) dx\ = \ /(0) 



> 1 



we get 



tx M /(«) 



du 



4e 2 



lim inf = 

g— >oo q — 2 







y>OG 


jx 7^ Xo : 


-2 


/ m dx 






Jo 



<^f(x x j)<0 



> 1 - 



/'i M /(«) 



dit 



(Note we need GRH here, so that X^>i f( x x-j) ^ s rea l)- Since f{x x .j) < implies 
x x j < /3 we may conclude that, after working out the integrals on the right hand 
side. 



lim inf 

q^oc q — 2 



#{x 7^ Xo : £ x ,i < /?} 



3 + it 2 + 72/3 2 - 8tt^ + 48ff 4 + 16n 2 p 4 
12tt 2 (4/3 2 - l) 2 



> 1 
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The right hand side is greater than zero for 



1 y/9 + ll7r2 + 2Vl8 + 667r2 

p > : « 0.633 

2 V11tt 2 -3 

as required. □ 
Remark. The test function we used in the proof is the optimum test function for 



Theorem 8.1, but that does not necessarily make it the optimum test function here. 
Indeed, the word "optimum" is not well defined here, as one can either try to find 
a function that maximises the estimate of the proportion of x satisfying x% jX < (3, 
or one could try to find a function which minimises the (5 for which this method 
proves a positive proportion of Xi )X < (3. 
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